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1. Introduction 

Suppose that (X„)„gN is a strictly stationary sequence of random variables 
with marginal distribution function F . We assume that this sequence has an 
extremal index 6 e (0,1], that is, for each t > 0, there exists a sequence of 
levels (u„(T))„gN such that lim„_,tx) n-F (un{T)) = t and 

lim Pr(A/„ ^ u„(t)) = e-'^", 

n — >oo 

where F = 1 — F and A/„ = ma.x{Xi, . . . ,Xn}. The extremal index can be 
interpreted in a number of ways, the most common one being the reciprocal 
of the mean cluster size in the limiting point process of exceedance times over 
high thresholds. The probabilistic theory was worked out in (11), (13), (15), (9), 
(14), and (12). 

Our objective is to estimate 9 based on a finite stretch Xi, . . . , X„ from the 
time series. Inference about the extremal index parameter has been extensively 
studied. The three more common approaches are the blocks method, the runs 
method and the intcr-cxccedance times method. The two first methods iden- 
tify clusters and construct estimates for 9 based on these clusters. For each 
of these methods, there are two parameters which determine the clusters and 
consequently the estimates of 6*: a threshold and a cluster identification scheme 
parameter. The third method is based on intcr-cxccedance times and obviates 
the need for a cluster identification scheme parameter. Some references on es- 
timation of the extremal index using these three approaches are (7), (8), (17), 
(18), (6), (10) and (16) among others. 

In this paper we focus on the blocks method. Traditionally, it consists of 
partitioning the n observations into consecutive blocks of a certain length, say 
r. In each block, the number of cxccedances over a certain high threshold are 
counted, and the blocks estimator is then defined as the reciprocal of the average 
number of exceedances per block among blocks with at least one exceedance. 
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Blocks estimators are usually eonstructed by using disjoint blocks, for in that 
case the blocks can be assumed to be approximately independent. 

The main novelty in this paper is our proposal to use sliding rather than dis- 
joint blocks, that is, to slide a window of length r through the sample, yielding 
n — r + 1 blocks rather than just n/r disjoint blocks. Surprisingly, this sim- 
ple modification leads to a more efficient estimator with a smaller asymptotic 
variance. Moreover we provide estimators of the asymptotic variances of the 
estimators, which permits the construction of confidence intervals and the se- 
lection of variance-minimizing thresholds. We also provide a way to estimate 
and correct for the asymptotic bias of the estimators. 

In contrast to most previous papers but in accordance with (16), we assume 
that thresholds and block sizes are such that the expected number of excesses 
per block converges to a positive constant. In practice, the threshold is chosen 
as a large order statistic. However, mathematical treatment of such random 
thresholds requires complicated empirical process techniques. 

The content of the paper is organized as follows. In Section 2 we introduce 
the blocks estimators for the extremal index. In Section 3 we consider asymp- 
totic variances and covariances of the mean number of excesses per block and 
the empirical distribution functions of disjoint and sliding block maxima. We 
establish consistency and asymptotic normality of our estimators in Section 4. 
We discuss how to estimate and minimize their asymptotic variance in Section 5 
and how to reduce their bias in Section 6. In Section 7, we investigate the fi- 
nite sample behavior of the estimators on simulated data and we provide a case 
study. Proofs are spelled out in the appendices. 



2. The estimators 



For positive integer r, put 
Fr{u) ■.= VY{Mr ^u), Tr{u) := rF{u) and 9r{u) := _ ^°S-^'-(") ^ 

Tr[U) 

(2.1) 

It follows from the definition of the extremal index that 9 = limr^oo ^r(wr) 
where Ur = Ur (r) . The estimators of 9 to be proposed arc based upon empirical 
analogues of the functions Fr and Tr- 

For integer ^ s < r, put Ms,r '■= maxs<i^r^i- Note that Mr = Mo.r- 
The distribution function Fr of the block maximum Mr can be estimated using 
maxima of k :~ \ji/r\ disjoint blocks or using maxima of n — r+l sliding blocks: 

i=l 2=0 

One may wonder why the use of sliding rather than disjoint blocks should make 
a difference. After all, the n — r + l blocks in the definition of i^,f^(it) are 
overlapping and hence strongly dependent, even in the iid case. Nevertheless, 
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we will show in Proposition 3.1 below that the asymptotic variance of F^^{u) 
is typically smaller than the one of F^^j.{u). 
Writing 

^ k r ^ rk 

T„,.(u) := ^ E E Ii^rU-l)+^ > - ^ E ^(^^ > (2.2) 

j — 1 i—1 2—1 

the disjoint and the sliding blocks estimators of the extremal index can now be 
defined as follows: 

As above, the sliding version will turn out to be more efficient than the disjoint 
one, see Corollary 4.3. 

The estimators require the choice of two tuning parameters: the threshold 
u and the block size r. If u is equal to the [fcrj-th largest order statistic of 
Xi, . . . , Xn, the disjoint blocks estimator is the same as 9[j\ in (16). As men- 
tioned in Section 1, the mathematical treatment of such random thresholds is 
intricate and requires empirical process techniques. For the sake of simplicity, 
the threshold sequence (u, )reN will be assumed to be deterministic. Compar- 
ing our Corollary 4.3 with (author?) (16, Corollary 4.2), it follows that this 
simplifying assumption does not make any difference asymptotically. 



3. Asymptotic variances and covariances 

The disjoint and sliding blocks estimators for 6 are functions of F^''j,{u), F^^j,{u), 
and fn,r(u). We shall need to find the asymptotic variances and covariances of 
the latter three estimators. Most importantly, we will show that F^^^^{u)—F^^{u) 
has a non-negligible asymptotic variance and is asymptotically uncorrelated with 
F^j.{u). As a result, the sliding blocks estimator for Fr{u) is the most efficient 
convex combination of the disjoint and sliding blocks estimators for Fr{u). The 
proofs of the results in this section are to be found in Appendix A. 

The maximal correlation coefficients of the process (A'„)„gN relative to the 
threshold u are defined by 

PnA'^) ni^^x sup I corr(^,77)|. 

*=!,...,«-; 5gL2(.7^i,f(«)) 

Here !Fa,b{u) is the cr-field generated by the events {Xi < u} for i e {a, . . . , 6}, 
and L2{T) is the space of J^- measurable square- integrable random variables. 
Obviously, the random variables ^ and 77 in the definition of Pn.i(u) should have 
positive variance. For comparisons with other mixing coefficients, see e.g. (2). 
Here we just wish to note that 

Pn,;(w) > an,i(u) max sup | Pr(A n B) - Pr(A) Pr(B)|. 

Be.Ft+,,„(u) 
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The coefficients q:„.( underiie the condition calicd A(u„) in (9) and are tliem- 
selves greater than the coefficients introduced in (11) yielding Leadbetter's 
D[un) condition. Since the upper bounds we will impose on pn^i will trivially 
imply the same upper bounds on an,i, the results in (9) become available to us 
as well. 

Let r„ and In be positive integer sequences such that, as ti — > oo, 

n 

= o(r„), r„ = o(?i) and ^ p„j(ur„) = o(r„). (3.1) 

i=i„ 

Note that the assumptions imply '}Z^=i Pn,i{ur„) = o(rn) and that the final 
assumption is implied by knPn,iniur„) — > as n ^ oo, where fc„ = \n/r„\. 

Proposition 3.1. Let {X„)neN be stationary with extremal index 9 and let 
(ur)reN be a sequence of thresholds such that rF{ur) ^ r £ (0,oo) as r — > oo. 
// (3.1) holds, then, as n oo, denoting a :~ 9t, 

fc„var(F„^j,jM,J) ^ e-"(l-e-"), (3.2) 
fc„ var(F„"|^ Jm^ J) ^ 2a-ie-" (l - (1 + a)e-") , (3.3) 
fc„cov(F,'^,|,Ju,J,F„^j,Jw,J) ^ 2a-ie-"(l- (l + a)e-"). (3.4) 

By Proposition 3.1, we can write F^^^^{ur„) = -P7i'r„ ) + £ni the random 
term e„ having mean zero, being asymptotically uncorrelated with F^^^{ur^), 
and having non-negligible asymptotic variance. As a consequence, the sliding 
blocks estimator of the distribution function of the block maximum is more effi- 
cient than the disjoint blocks estimator. The two asymptotic variance functions 
as well as their ratio are shown in Figure 1. Observe that the relative efficiency 
of the disjoint versus the sliding blocks estimator is decreasing in a. For a 0, 
the clusters of exceedances become very sparse, and the two estimators are 
asymptotically equivalent. 

In order to get the asymptotic covariances between f„,r„ (mt-„) in (2.2) with the 
disjoint and sliding blocks estimators for Fr{u), a somewhat stronger condition 
on the maximal correlation coefficients is needed: 

^Pn,iiurJ = oirl/"^), ?i->oo. (3.5) 
1=1 

Proposition 3.2. If in addition to the conditions of Proposition 3.1 also (3.5) 
holds, then as n oo, 

kn COv(F^J^^ (Ur„), T„,r„ ("r„)) 
kn COV (^;t!r„ i'^r„ ),Tn,r„ ("r„ )) 

Finally, in order to find the asymptotic variance of Tn,r„{ur„), another addi- 
tional assumption is needed: there exists a positive integer sequence (s„)jgN and 
a probability distribution (7rj)jgN on the positive integers such that as n ^ oo 

ln = o{s„), s„=o(r„), Pn,l^{UrJ = 0{Sn/n) (3.7) 
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Disjoint versus Sliding Blocl<s 



Disjoint versus Sliding: Asymptotic Relative Efficiency 




Fig 1. Left: Asymptotic variance functions of disjoint and sliding blocks estimators of Friu), 
see Proposition 3.1. Right: Asymptotic relative efficiency of disjoint versus sliding blocks esti- 
mators of Fr(u). 



as well as, writing Ns{u) = > u), 

Pr{Ns^{ur^) = j I > Ur„} ^ TTj for all j e 



(3.8) 



The distribution (7rj)jgN is called the cluster size distribution; it describes the 
limiting probability distribution of the number of threshold excesses within the 
block Xi, . . . , Xs^ given that there is at least one such excess. The second part 
of (3.8) is a uniform integrability condition ensuring that the first two moments 
of the finite-sample cluster size distribution converge to the proper limits. Note 
that Pr(Ms„ > u,-,,) ^ s„ Pi{Xi > Ur„) ^ as n ^ oo while 

Under the above conditions, the asymptotic distribution of Nj.^{ur„) is com- 
pound Poisson (9, Theorem 5.1): 



NrAUrJ^N■.= J2C^ 



(3.9) 



where v is a, Poisson(0T) random variable and (Ci)igN is a sequence of positive 
independent and identically distributed integer-valued random variables from 
the cluster size distribution, independent of v. Note that E(Ci) = '^j>ij^j = 
6^^. Moreover, 



E(A^) - T, 



var(A^) =aJ2f^j 



(3.10) 
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Proposition 3.3. If in addition to the conditions oj Proposition 3.1 also (3.7)- 
(3.8) hold, then as n oo, 

oc 

fc„ var(f„^r„(ur„)) — > var(A^) ^a^^j^TTj. (3.11) 
4. Weak consistency and asymptotic normality 

The main result of this paper is the joint asymptotic normality of the disjoint 
and sliding blocks estimators for 6 in Corollary 4.3. The proofs of tlie results in 
this section are to be found in Appendix B. Write 

mp:=Y,f^3' pe{l,2}. 

Recall that mi =9^^. 

Theorem 4.1. Let (Xn)neN be stationary with extremal index 9 and let {ur)reN 
be a sequence of thresholds such that rF{ur) —>■ t Cz (0, oo) as r —^ oo. // 
conditions (3.1), (3.7) and (3.8) hold, then as n oo 

^«!r„ ("r„ and 9^1^^ iur„ ) ^ 9. 

In order to get asymptotic normality of tire estimators, we will need an ad- 
ditional technical assumption: there exists a constant p witlr p > 1 such that as 
n — > oo, 

E[<f (u,J] = 0(1). (4.1) 

We first state joint asymptotic normality of /"^^^(u), F^^{u), and fn^r{u). 
Joint asymptotic normality of 9'^^^{u) and 9^l^{u) tlien follows by the delta- 
method. 

Theorem 4.2. Let (X„)„gN be stationary with extremal index 9 and let (ur)reN 
be a sequence of thresholds such that rF{ur) —^tG (0,c») as r oo. If 
conditions (3.1), (3.5), (3.7)-(3.8) and (4.1) hold, then 



iiV(0,S) 



(Fnki^rJ~FrAurj\ 
\ T„,r„ {Ur„ ) - rr„ (Ur„ ) / 

where S — (o'y)f is symmetric and, writing a = t9, 

<Jii = e-"(l-e-"), 

CT22 = <Ti2 = 2a~^e-°' + a)e-") , 

CTai = 0-32 = -re~", 

(T33 = am2. 
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Recall Or{u) in (2.1). In order to control the bias of the extremal index esti- 
mators, assume that the block sizes are sufSciently large so that as n — > oo, 



9,„(u,J-0 = o(1/v/A~). 



(4.2) 



The asymptotic variance of the extremal index estimators will depend on 6, t, 
and the squared coefficient of variation of the cluster size distribution [T:j)j^^: 



2 ._ 



TO2 



TOT 



Corollary 4.3. Under the conditions of Theorem 4-2 and (4.2), as n ^ oo, 

where V = ivij)i j=i 'is symmetric and 



N(0,V) 



Vll 



2„2 



V22 



Vl2 = 2^ 



0^- 

a 

1 — a 



a 



The asymptotic variance of the disjoint blocks estimator corresponds with 
the one for the same estimator but at a random threshold (order statistic) in 
(author?) (16, Corollary 4.2). It is worth noting that V22 ^ vii- As a result, the 
sliding blocks estimator is more efficient than its disjoint version. Even more, the 
most efficient convex combination of the disjoint and sliding blocks estimators 
is the sliding blocks estimator itself. 



5. Estimating and minimizing the asymptotic variance 

For a fixed ^ 0, the asymptotic variance functions of \/k^i{On,r„/S — 1), 

{a^2(e" — 1 — a) + a^^c^ , disjoint blocks, 

./ n 10 (5.1) 

2a-^{e°'-l-a-\\+a-^c^, sliding blocks, ^ ' 

are convex and possess unique global minima. These minima and the values of a 
for which they are attained can be computed numerically, see Figure 2. Hence, 
given an estimate of c^, we can estimate the respective optimal values for a, 
divide by an estimate of 9, and thus obtain estimates of the optimal r to be 
used for the disjoint or sliding blocks estimators. Given such estimates, we can 
for a given threshold u estimate the asymptotically optimal block lengths r and 
vice versa. 

The missing element in this procedure is an estimate of c^. Knowledge of 
is also needed when one wants to construct asymptotic confidence intervals 
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Disjoint versus Sliding: Minimal Asymptotic Variance Disjoint versus Sliding: Optimal alpha 




0.0 0.5 1.0 1.5 2.0 0.0 0.5 1.0 1.5 2.0 



„2 „2 
C C 

Fig 2. Minima (left) and minimizers (right) of the asymptotic variance functions in (5.1) of 
the disjoint and shding blocks estimators for d. 

for 6 based on Corollary 4.3 or estimate the asymptotic bias of the extremal 
index estimators in Section 6 below. In addition, the quantity is interesting 
in its own right as a measure of dispersion of the cluster size distribution (7rj)jgN. 
Since the mean cluster size is equal to mi = 0^^, for which consistent estimators 
are available, we can focus here on estimating the cluster-size variance m2 — m\ 
or the second moment m2. 

A first possible strategy to estimate the cluster-size variance is to partition the 
threshold exceedances into clusters and estimate the cluster size variance by its 
empirical counterpart. However, this is difficult for two reasons: (a) the rareness 
of the clusters, and (b) the uncertainty on how to group the observed excesses 
into clusters. For nonparametric estimators of the cluster-size distribution, we 
refer to (5) and (16). 

On the other hand, we can remain in the spirit of the paper and propose 
a sliding blocks estimator. Recall Nr{u) ~ J2i=i^{-^i > its compound 

Poisson limit N in (3.9). Put crf.{u) := var(iVr('u)) • Under an appropriate uni- 
form integrability condition, we have by (3.10), as r — > oo, 

cr'^{uj.) — var(iVr(ur)) va.r{N) ~ mn2. 

Recall Na,b{u) :~ J2a<i<b ^(-^i > ^'^^ ^ threshold u and a block size r, we 
define 

^ n—r 
Nn,r{u) ■■= — Ni,i+r{u), 

n — r + 1 ^-^ 

1=0 
^ n — r 

'^"^'-("^ ^= ^r-TE(^M+.(«)-^«..("))'- 

i=0 

We set the denominator equal to n — 2r -f 1 in order to reduce the bias of af^ ^ (u) 
from 0{l/k) to 0(l//c^) in case {Xn)n£n is iid. 
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The sliding blocks estimator for — 7712/ ml — 1 = 6^1712 — I — 9 va.r{N)/T—l 
then becomes 

We derive the consistency of ^ (u) under a condition on the fourth moment of 

E[iV4(w,)] = 0(1), r->c3o. (5.2) 

At the price of a longer proof involving a characteristic function argument, 
condition (4.1) on the moment of order 2p (with 1 < p < 2) would be sufhcient 
as well. 

Proposition 5.1. // (3.1), (3.7)-(3.8), (4.1) and (5.2) hold, then cl ^ {u^ ) ^ 
The proof of Proposition 5.1 is given in Appendix C. 
6. Reducing the bias 

Recall 9r{u) as in (2.1) and let dn.r{u) denote either the disjoint or the sliding 
blocks estimator. The bias of 9n.r{u) can be decomposed into two parts: 

ndn,r{u)] -9^ {¥.[9n,r{u)] - 9r{u)) + {9r{u) - 9). 

The component 9r{u) — 9 is inherent to the process {Xn)n&i itself. For the 
three examples below it holds that if (ur)r6N is such that rFiur) = 0(1) then 
9r{ur) — 9 = 0{l/r) as r ^ oo. 

The presence of the component E[6'„,r(u)] — 9r{u) stems from the fact that 
the mean of a function of a random vector is in general unequal to this function 
applied to the mean of the random vector. By a second-order Taylor expansion, 
it follows that 

n0n,riu)]~9Au)- MFn,r{u)) , COV (4,. (^) , r„, (u)) 



2E'^[FnAu)] E[t„,,(m)] E[Fn,r(u)] E'^[Tn,r(u)] 

l0gE[F„,r(u)] 



var 



{Tn,r{u)). 



By the above expansion and Propositions 3.1 and 3.2, we obtain, as n oo. 



fcn(E[0„,r„(Wr„)] " 9r„{UrJ) ^ M = 



Mdj := 6l(2a)-i (e" - 1) + a-^9c^, 
Msi ■.= 9a-^{e" ^l~a) + a~^9c^, 

(6.1) 



for the disjoint and the sliding blocks estimator, respectively. Note that ^ 
Msi ^ Mdj • If in addition 

6'r„(Ur„) -6* = 0(l/fc„) (6.2) 
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then it follows that, as n ^ oo, 

Just like the asymptotic variances in Corollary 4.3, the asymptotic biases of 
the disjoint and sliding blocks estimators in (6.1) are functions oi 0, a = 6t, and 
c^. Given consistent estimators of these three quantities, we can estimate /i and 
then correct the extremal index estimators by subtracting fl/k. Observe that this 
procedure has to do with the 0{l/k) asymptotics of the estimators only, whereas 
minimization of the asymptotic variance affects the 0{1/Vk) asymptotics. 

Note that condition (6.2) is slightly stronger than (4.2). In case 9r„ {ur„) — 9 = 
o{l/r„), as in the three examples below, (6.2) is equivalent to fc„ = o(r„), that 
is, n^/^ = o(r„). In contrast, for condition (4.2), the requirement is only that 
kn = o{rfJ, that is, n^^^ — o(r„). 

Example 6.1 (IID sequence). Let (X„)„gN be a sequence of independent ran- 
dom variables with a common, continuous distribution function F. Then = 1 
and 

r{Or{ur) - I) ^ -, r^oo. 

Example 6.2 (Max Auto-Regressive Process). Let (W„)„gN be a sequence of 
independent, unit-Frechet distributed random variables. For < ^ 1, let 
Xi = Wi/9 and Xn ~ max{(l — 9)Xn-i; Wn}, n ^ 2. The extremal index of 
the sequence is equal to 9 and 

r{9r{ur)-9) ^^ + (1^9), r^oo. 

Example 6.3 (Moving Maximum Process). Let {Wn)neN be a sequence of 
independent, unit-Frechet distributed random variables. Let Xi = 2Wi and 
Xn = max(W„-i, Wn). The extremal index of the sequence is equal to 6' = 1/2 
and 

r{9r{ur) — 9) ^ —, r ^ oo. 



7. Numerical examples 
7.1. Simulation study 

The finite sample properties of the disjoint and sliding blocks estimators for 
the extremal index are compared in a simulation study. Sequences of length n = 
10 000 are simulated from Max Auto-Regressive processes with 9 = 0.25, 0.5, 0.75 
and 1. For each sequence the estimators ^jJVl"") ^^'^ ^nr('") computed for 
five block sizes and two thresholds. The block size is r = 25, 50, 100, 200 or 400. 
The threshold u is the [fcrj-th largest order statistic and is defined by either 
a default value of r = 1 or the estimate of the optimal value of t described in 
Section 5. The initial estimates of and 9 required in the latter case are based 
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~i 1 1 1 r ° ~n 1 1 1 r 

25 50 100 200 400 25 50 100 200 400 




~i 1 1 1 r ° ~n 1 1 1 r 

25 50 100 200 400 25 50 100 200 400 




Fig 3. Monte Carlo approximations to the biases (left) and standard errors (right) of the 
disjoint (dashed) and sliding (solid) blocks estimators for the extremal index with default 
(thin) and optimal (thick) choices for r plotted against block size r on a logarithmic scale. 
Data are simulated from Max Auto-Regressive processes with 9 = 0.25, 0.5, 0.75 and 1 (top 
to bottom). 
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Fig 4. Monte Carlo approximations to the biases of the bias-corrected disjoint (dashed) and 
sUding (sohd) blocks estimators for the extremal index with default (thin) and optimal (thick) 
choices for r plotted against block size r on a logarithmic scale. Data are simulated from Max 
Auto-Regressive processes with 9 = 0.25 (top-left), 0.5 (bottom-left), 0.75 (top-right) and 1 
(bottom-right). 



on the threshold when t — 1. Monte Carlo approximations to the properties of 
the estimators are computed from 10 000 simulated sequences. 

Figure 3 shows the biases and standard errors of the estimators. Biases tend 
to be positive and smallest at intermediate block sizes while variances increase 
with block size. Sliding blocks always yield lower standard errors than disjoint 
blocks. There is also evidence that sliding blocks yield larger biases than disjoint 
blocks when r is small and smaller biases when r is large. Optimizing t tends to 
yield lower variances than the default r = 1, but also larger biases when 6 < 1. 
This is explained by the fact that the estimated values for the optimal r tend to 
exceed 1 except when 9 = 1. Example 6.2 suggests that increasing r increases 
the bias. 

The effect of the bias correction described in Section 6 is shown in Figure 4. 
There is little improvement for small block sizes, but biases are reduced signifi- 
cantly and stabilized for larger block sizes. The impact on the standard errors 
is negligible (not shown). 

The positive biases of 6f^^^{u) and 6,1 ^{u) can lead to poor coverage proper- 
ties (not shown) of confidence intervals for 9 based on the asymptotic Normal 
distribution of Section 4. Lower and upper confidence limits tend to be too high 
when r is small but coverage improves when r is large. Coverage is also affected 
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Fig 5. Daily log returns of the FTSEIOO index from 25 December 1968 to 12 November 2001. 



by underestimation of standard errors when 6 <1 (not shown). 

These simulations were repeated for the doubly stochastic process of (17) and 
for ARCH(l) processes; see (3) and (author?) (4, Chapter 8). Results for the 
doubly stochastic process were very similar to those reported above. Results for 
ARCH(l) processes were also similar but the improvement in variance afforded 
by sliding blocks was less clear. Qualitatively similar results were found when 
the simulations were repeated with n — 1000 and r = 5, 10, 20, 40 or 80. 



7.2. Case study 

The extremal index is now estimated for a financial time series: daily log returns 
of the FTSEIOO index between 25 December 1968 and 12 November 2001. This 
series was analysed previously by (10) and is plotted in Figure 5; the data were 
kindly passed on by Jonathan Tawn. Clusters of large, negative returns can be 
financially damaging so estimates of the extremal index for the negated series 
are plotted against block size in Figure 6. Two sliding blocks estimators are 
compared: both employ the bias correction but one uses the default value t — \ 
while the other uses estimated optimal values r = fopt . Thresholds are the [fcrj - 
th largest order statistics so that the proportion of data exceeding the threshold 
for block size r is r/r. The lower horizontal axis in Figure 6 is therefore a 
transformation of the threshold used when t — 1, and coincides with the scale 
used by (10). The upper horizontal axis represents the same transformation of 
the threshold when t = fopt- These latter thresholds are lower because fopt ~ 5 
for all but the smallest block sizes. 

The point estimates from the two sliding blocks estimators are similar and 
both stablize near 6 = 1/3. Estimates from the intervals estimator of (6) are 
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1 2 

-logi„[-log(1-1/r)] 



Fig 6. Sliding blocks estimates (solid) of the extremal index for the negative daily log returns 
plotted against block size r on a log-log scale. The estimates use default (t = 1; thin) and 
optimal (r = Tbpt; thick) choices for r. Intervals estimates (dotted) arc based on the threshold 
obtained when r = 1. Shading indicates pointwisc 90% confidence intervals based on the 
asymptotic Normal approximation for the sliding blocks estimates. The top axis represents 
-logio[-log(l - T'opt/r)]. 



also shown and differ slightly but are consistent with an extremal index of one- 
third once sampling variation is taken into account. However, these values are 
approximately half those obtained by (10) with a two-thresholds estimator. The 
confidence intervals in Figure 6 are computed using the estimated standard 
errors for the sliding blocks estimators with no bias correction. The confidence 
intervals when r = fbpt are often much narrower than when t = 1 owing to the 
lower thresholds mentioned above. 
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Appendix A: Proofs for Section 3 
A.l. Proof of Proposition 3. 1 

Asymptotic variance of F^\^{ur^^) . By stationarity, 

var(F„dj,H) - ^Fr{u){l-Fr{u)} 
2 

+ p {Pr(A^(<-l)r,^r A^O-l)rjr ^ u) - F'^ {u)} 

= \Fr{u){l - Fr{u)} + ^i^^{F2.(u) - F^ {u)} 

2 ''"^ 

By definition of the extremal index, Fr^{ur„) e~'^^ and F2r^{ur^) e^^'^^ 
as n ^ oo. Hence (3.2) will follow if we can show that 

7^ ^ (fcn - s){Pr(Mo,r„ ^ Ur„, A^sr„,(s+l)r„ < Ur„) " ^r„(Wr„)} 

as n — > oo. But since the maximal correlation coefficient pn,i{u) is decreasing in 
I, the left-hand of the previous display is bounded in absolute value by 

Pn.(s-l)r„('«r„) ('»r„ ) ■ 

By hypothesis, this converges to zero as n ^ oo. □ 
Asymptotic variance of F^^ (^^r„)- By stationarity, 



var(i^;^;,(w)) = t^FAu){l-Fr{u)} 



1 

n — r + 1 

^ (n-r + 1)^ ^ ^ cov(/(A/,,,+^ u), /(Mjj+^) i 
The sum on the right-hand side of the previous display can be written as 



(n-r-hl)2 ^ 

^ ^ S — 1 



— ^(n -r + l-s) cov{l{Mo^r ^ u), /(M,,,+^ < u)) 



T-Efl '-—){Fs+riu)~FAur} 

1 r V n — r + 1 / 



n-r + ^ . 

s—l 



2r 1 



+ — - V 1 —] COY {I [Mo, r^u), I {Ms. s+r i^u)) 

s—r+l 
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By dominated convergence, as n — > oo, 



- E f 1 '^—7] {Fs+r^ ("r„ ) - Fr,^ (u, J^} 



s — 1 ^ 

t9 

Hence (3.3) will follow if we can show that, as n — > oo, 

— 1 ZT cov(/(Afo,r„ u,.J,/(Af,,,+^„ s; Urj) -> 0. 

But this follows from the assumption that 'i'^'^^^^^pn.i{ur^) ^ as n ^ 

00. □ 

Asymptotic covariance of F!^^^^{uj.^) and F^^^{uj.^). We have 
= cov(/(Afo,. ^ + cov(/(M(fc_i),,fc, 

k—1 n — r 

+ cov(/(Af(,_i)„, u),I{Mj,,+r ^ u)). 

^ 1/2 

The first two terms on the right-hand side are bounded by {var(F^'^(M))} ; in 
view of (3.3), they will not contribute to the limit. The final term on the right- 
hand side of the previous display can be decomposed into two pieces, I + II say, 
according to whether [i — 2)r j < ir or not. For the first term, the union of 
the intervals of integers {{i — l)r + 1, . . . , ir} and {j + 1, . . . , j + r} is again an 
interval of integers; by stationarity, 

k-2 

/ = ) COY [I {Mr 2r^u), I{Mj , u)) 

n — r + 1 ^-^ 

3=0 
2r — 1 

n — r + 1 r ^-^ 

3=0 

Adding the subscript n to indicate the dependence on n, we get 

r-2 ^ ,1-e-" 



lim /„= / (e-{(2-«)vn}a_g-2a)^^^2fe-"- 



a 



The second term can be bounded as follows 

2{k~-2) 



II ^ 



n — r + ^ 

1=1 



Since r„ ^ J2i=i Pn,i{ur„) — > by assumption, we get //„ — > as n oo. □ 
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A. 2. Proof of Proposition 3.2 

Asymptotic covariance of F^\^{ur^) anc? f„^j.„(ur„)- We have 

^ k kr 

kC0Y[F^]^{u),fn,r{u)) = ' T^^CO-v[l{Mf^i_-i^)^^„ «C u),I{Xj < u)) . 

i=l j=l 

We split the sum according into two pieces, I + II say, according to whether 
(i — l)r < j ^ r or not. By stationarity, the first term is equal to 

r 

/ = -^cov(/(A/o,, SC u),I{X, s; j)) = -r{l - F{u)}Fr{u). (A.l) 
i=i 

Adding subscripts n to indicate the dependence on n, we get /„ — re~" as 
n oo. The second term, //, can be bounded in absolute value as follows: 

n 

\II\ s; 2{var(/(AVs;w))var(/(Xi ^w))}'/'^p„j(?/) 

1=1 

n 

^ 2{l-F(u)}V2^p„/w). (A.2) 
(=1 

Assumptions (3.1) and (3.5) now imply that //„ —i-Oasn—f oo. □ 
Asymptotic covariance of Ff^^^^{ur^) and Tn,r„(ur„)- This time, we have 

^ n~r kr 

kcOv{F.^],{u),Tr,,riu)) = - ^ _ ^ ^ ^ ^ C0v(/(M,,,+, ^ u) , I {X j ^ u)) . 

" ^ 1=0 3 = 1 

We split the sum according into two pieces, I + II say, according to whether 
i < j ^ i + r or not. The first term, / is the same as in (A.l) and so gives rise to 
the same limit. The second term, //, admits the same bounds as in (A.2) and 
hence is asymptotically negligible. □ 

A. 3. Proof of Proposition 3.3 

Recall Ns{u) = Yfi^^ I{X, > u). We have 

fc var(f„^r(u)) = i var(iVrfc(u)). 

For integer ^ a ^ 6, put Na,b{u) :~ '^a<i^b H-^i > '^)^ the sum being zero if 
a = b. Fix integer 1 ^ I < s < n and write m [rfc/sj . We have 

m m 
Nr„kr,{UrJ = '^^(i-l)sr,,is„-l,JUrJ + ^ A^is^ ,is„ (Ur„ ) 
i=l 1=1 

+ ^rn„Sn-rnkn{Ur„) =• An + -Bri + Cn • 



C. Y. Robert, J. Segers and C. Ferro/Sliding blocks estimator for the extremal index 18 

By the Cauchy-Schwarz inequality, it is sufRcient to show that, as n — s- cx), 

(l//c„) var(^„) am2, (l/fc„) var(S„) ^ 0, (l/fc„) var(C„) 0. 

Before we treat these three terms, it is useful to note that the assumptions imply 
that 

var(A^i^ (Mr„)) = o(s„ Pr(Xi > Mr„)), n — > oo, (A. 3) 

which in turns implies 

(l/fc„)m„ var(A'';^(ur„)) ^ 0, ?i ^ oo. (A. 4) 

[Proof of (A.3): var(iVi„ < E[A^f„K„)] < Sn Pr(^i > Wr„ ) E [A^f^ (wr„ ) I 
Mg^ > Uj.^] and the final expectation tends to zero by uniform integrability 
and Vv{NiJurJ > | M^^ > u^-J ^ Z„ Pr(Xi > UrJ/PriM^^ > UrJ ~ 

{ln/Sn)0-^ ^ 0.] 

The term An- By stationarity, (l/fc„) var(A„) = /„ + //„ with 
/„ := (l/fc„)m„ var(iV5„_/„(u,.„)), 

//„ :=2(l/fc„) COv(7V(j_i)s„,js„_;„(Mr„),A^0-l)s„js„-!,.(Ur„))- 

We first treat /„. We have 

= F.[N^JurJ I > u^J Pr(A^„ > UrJ - {s„ Pr(Xi > UrJ}^. 

Since m„ ~ n/sn and Pr(Afs„ > Ur„) ~ Sn Pr(Xi > Ur^)9 as ^ oo, we find 

(1/A:„)m„ var(iVs,,(ur,J) — > 9Tm2, n — > oo. 

Since Ns^{ur„) = Ns^^i^^{ur,J + Ns^-i^^s„iur„) and since Ns^^^i^^s^iur^) and 
7V;^(ur„) have the same distribution, the previous display and (A. 4) imply that 
/„ 9Tm2 as n — > oo. 

Next we treat //„. We have 

\IIn\ ^ 2(l/fc„)m^ var(A^s,.-i„(wr„))p„j„(ur„)- 

In view of what we obtained for /„ and since rnnPn,i„iur„) — > 0, we conclude 
that //„ — > as n — > oo. 

The term Bn. By stationarity 

(l//c„) var(B„) 

^ (l/fc„)m„ var(A';„(ur„)) + 2(1/A;„)to^j var(A';„ (ur„))p„j„ (ur„)- 

By (A. 4) and since mnPn,i„{ur^) 0, we obtain that var(i?„) ^ as 

n — > oo. 
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The term C„. By stationarity, 

(l/fc„) var(C„) = (l/fc„) var(7V„_,„„^„(ur„)) 

(i/fc„)E[Af^„„,„KJ] (i/fc„)E[A^,'„KJ] 

= (l/fc„) Pr(M,„ > UrjE[N^^{urJ I A/.„ > u^J. 
By assumption, the limit as n ^ co is zero. □ 

Appendix B: Proofs for Section 4 
B.l. Proof of Theorem 4-1 

By Propositions 3.1 and 3.3 in combination with Tchebychev's inequality, it is 
not difficult to see that as n ^ oo, 

\Fn,rJUrJ-Fr^{Urj\ + \K\rJUrJ~FrJUrj\ + | T„,r„ K J - T^„ K J | ^ 0. 

By definition of the extremal index, Fr^{ur^) — > e^^'^ and Tr^{ur^) ^ r as n — > 
oo. The result follows by continuity of {x,y) i-^ — log{x)/y on (0,00) x (0,oo). 
□ 

B.2. Proof of Theorem 4.2 

Write Zn = iZn,l,Zn,2, Zn.sV ^ith 

ZnA = \/k^{Fn,r„{'UrJ " ^r„("r„)), ^n,3 = ^('i'«,r„ (Wr„ ) ^ Tr„(Wr„)), 
Zn^2 = Vk^.{F^n,rS'^r„) " i^r„("rj). 

By the Cramer- Wold device, it is sufficient to show that for a — (ai, 02, 03)^ € 
Note that 

Zn,l ^= ^ ^n,3 = ^= ^ ^»,r„, 



1^1 



" t=o 

where for i S {1, . . . , A;} and t G {0, . . . , n — r}, 

ir 

t=(j-l)r+l 

Il\ = I{Mut+r ^ Ur) ~ Fr{Ur). 
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The idea of the proof is as follows: By clipping out certain terms in the defini- 
tions of Zn^j^ the latter can be viewed upon as sums of approximately indepen- 
dent random variables. Asymptotic normality then follows from the Lindeberg- 
Feller central limit theorem for triangular arrays. 

Let k* £ {1, . . . , fc — 1}. Construct a partition of {1, . . . ,k} into subsets of 
size fc*, with two adjacent such subsets separated by a singleton. The number 
of subsets of size k* that can be formed in this way is g = [{k + l)/(fc* + 1)J ■ 
We have 

q~l A ^ 

Let fc* = fc* be such that fc* — > oo but fc* — o{kn) as n — > oo. The final 
two terms on the right-hand side of the previous display are negligible as their 
variances tend to zero: the variance of the second term on the right-hand side 
is of the order 

O var(jfj,J + 2|^p„.,„(w.J var(jfj,j) = o(l), 
where we used g„ — o{kn), var(/f^^^) — 0(1), and 

Pn,r„("r„) = 0(l/fc„) (B.l) 

[Proof of (B.l): by (3.7) Pn,r„{ur„) ^ PnjA'^r^) = o(sn/n) and s„ = o(r„)]; 
similarly, the variance of the third term is of the order 

O ("2^ var(/fj,J + 2i^p„,,,.(w,Jvar(/fj,j) - o(l). 

As a consequence, 

, 9„ i(fc:+i)-i 

V « j = l,=(j_l)(fe.+l) + l 

In a completely similar way, just replacing J^i by Ni^r, we can also show that 

, ?„ j(fc;+i)-i 



V ^" J=lj=(j-l)(fc.+l)+l 

a crucial element here is that var(iVi^r„) — 0(1), which follows from (4.1). 

Next, construct a partition {l,...,n} into fc blocks of size r and in case 
fcr < rt a final block of length n — kr. Form subsamples by taking unions over 
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k* consecutive blocks of size r, two consecutive subsamples being separated by 
a single block of size r. The number of subsamples that can formed in this way 
is again q = [(fc + l)/(fc* + 1)J , the jth subsample being 

i(fe-+i)-i 

= {(i - l)(fc* + l)r + 1, . . . , j(fc* + l)r - r). 

In the definition of the sliding-blocks estimator, retain only those t e {0, . . . , n — 
r} such that the (sliding) block + 1, . . . , i + r} is contained entirely in one of 
the subsamples Sj^k* ,r- In other words, discard those t such that {i + 1, . . . , t + r} 
has a nonempty intersection with one of the q~ 1 blocks of size r separating 
two consecutive subsamples or with the remaining part of the sample after the 
final subsample Sq^k* .r- The values of t to be retained are then given as follows: 
for j e {1, . . . ,17}, 

+ 1, ...,< + r} C Sj^k'^r if and only if (j - l)(fc* + l)r ^ t ^ j{k* + l)r - 2. 
We find 



n-r /77 q j(k* +l)r-2r 



t=0 i=l t=(i-l)(fe*+l)r 



/T:?-! ](k'+l)r-l rr 

E ^t + v E 

3 = 1 t=j{k-+l)r-2r+l t=q(k' +l)r-2r+l 



t,r- 



Again, let k* — fc* be such that /c* ^ 00 and /c* — o{kn) as ?! 00. Asymp- 
totically, the variances of the final two sums tend to zero: the variance of the 
second sum on the right-hand side is of the order 



O 



^{<?n + qlPn,r^iUrJ}rl^ = O + |^Pn,r„ (Ur„ )^ = o(l); 



and since the number of terms in the third sum on the right-hand side is not 
larger than 2k'*r, the variance of that sum is of the order the variance of that 
sum is of the order 



0('^Krl)^o(f]^oil). 



As a consequence. 



lE E n]r.+o,{\). (B.4) 



" ''""^ j=lt={]-\)(k'^ + l)r 
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Equations (B.2), (B.3) and (B.4) can be summarised as 

qn 



Zn,v = y^^Kj,!, + Op(l), t;e {1,2,3}, 



with for j e {1, . . . ,g„}. 



■dj 



i=0--i)(fc;+i)+i 

J— j(fe;+l)r„-2r„ 

^ _ V 'Znfcn fsl 

n — r„ + 1 -^-^ 

i = a-l)(fc-+l)r„ 

i=0-i)(fc;;+i)+i 

As a consequence, 

0-lZn,l + a2-Z^n.2 + 03^11,3 — Cn,J + Op{\-) 

with — ai^n,j,i + a2^n,j,2 + a3^n,j,3- Note that £,n.j,v is measurable with 
respect to the cr-field generated by the events {Xt ^ Ur} with t ranging over 
the jth subsample Sj^k^^r„- Since these subsamples are separated by at least 
one block of size r„ and since Pn,r„iur„) — o(l/fc„) = o(l/g„), a characteristic- 
fimction argument shows that the asymptotic distribution of qn 
the same as if the variables ^n,i, • ■ ■ , Cn,g„ were independent. 

We apply the Lindeberg-Feller central limit theorem with Lyapounov's condi- 
tion. By Propositions 3.1, 3.2 and 3.3 applied for sample size n* — k*r together 
with the fact that Qn/kn ^ 1/fc* and qnkn/{n — ''n + 1)^ ~ l/(^n^n)i have 

var(^„.i) a^Sa, n — s- 00. 

Lyapounov's condition finally requires that there exists 5 > such that 

We will show that Lyapounov's condition holds for 2 + 5 = p with p as in (4.1). 

For /i 5^ 1, integer m ^ 1 and real numbers ci, . . . , c™, we have | YlT=i ^ 
j^M-i ^™ ^ |ci|^ (proof by Jensen's inequality). As a consequence, it is sufficient 
to show that for v G {1, 2, 3}, 



2+i5i 
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Above we have chosen /c* in such a way that /c* oo and fc* = o(fc„) as 
n — > cxD. Now we reinforce the latter requirement to 



Then for w = 2, since {if^l ^ 1, we simply have 



(B.5) 



= 



QniK) 



* \2+S 



o 



*\1+S 



^-5/2 



0(1). 



For V g {1, 3}, we have to proceed a little differently. Let Ci,r be equal to ifl if 



V = 1 and Ni^r if w = 3. Then 



2+S' 



*\2+S 



,1+5/2 



E[|Cl,r, 



|2+<5l 



Now E[|Ci,r-„ P^*] — for D = 1 this is obvious and for w = 3 this follows 

by condition (4.1). Again, requirement (B.5) on A:* ensures that the right-hand 
side of the previous display is o(l) as n ^ oo. □ 

Appendix C: Proof of Proposition 5.1 

In view of Theorem 4.1 and the continuous mapping theorem, we only need to 
show that (T^^^(m„) is weakly consistent for var(iV). It is not difficult to see 
that, as n — > oo. 



Define 



Nn,r„{UrJ = 'rn,r„ (Wr„ ) + Op(l) = Tr^{Ur^) + Op(l). 
n—r 

n — r + 1 ^-^ ^ ' 



Then as n ^ oo 

^ir„K„) - 



2 /_ ^ n-2rn + l 



n-rn + 



Y^n,rS'^-^^) (^«,r„(Ur„) - Tr„(Ur„)) =Op(l). 



Hence we can proceed with (T^^(u) rather than with (T^^(w). By stationarity 
and the fact that E[A^r(w)] = rF{u) = rr(wr), we have 

E[CT^,r(u)] var(iVr(M)) = ct^('u). 
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By uniform integrability, the moment condition (4.1) implies (jf.{ur) — > am2 as 
r — > oo (1, Corollary p. 338). Hence we only need to show that var (ct^ ^ (u^^)) 
as n — > cxD. 

Put £,n,i {^i,i+r„(Mr„) " Tr„(Mr„)}^- Then 



C^n,r„(Wr„) = (n - r„ + 1) ^ ^ ^n,i 

and thus, writing k„ '■= var(^„^i), 
var(a-^^^„(u^„)) 

2 



Y)2 ^{n-rn + l- S) COv{£,n,0, £,n,s)- 



n-rn + l (n-r„ + l) 

The first term on the right-hand side converges to zero as n — ^ oo since k„ = 
0(1) by (5.2). The second term can be written as a sum /„ + //„ according to 
whether s < or not. The first term can be bounded by 

\In\ ^ , Kn ^ 0, n^OO. 

n-rn + l 

The second term is 

2 " 

\IIn\ < --r'^n/Pn,liUr„), 

n — r„ + 1 — ' 

1=1 

which, in view of (3.1), converges as n — > oo to zero as well. □ 
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